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PROBLEM TO BE SOLVED: To provide a document reading 
system which can properly recognize documents without being 
affeciBd-by^writing means, such as the printer and pencil. 
SOLUTION: This document reading system converts the image 
of a slip read by means of a scanner 1 1 into multilevel image 
data and stores in an image memory 18 by means of an image 
inputting section ^^achlfi^of the image data is segmented 
by means of an im^gp-rSP.gmpntin g section 15 and binarized 
based on binarized thresho lds a^every field j-egistered in a slip 
layout data base 19 by means of a binarizing section 16. Binary 
image data stored in a binary image file 20 are read out and 
recogni zed by me ans of a recognitio nj processing section 17 . 
Since tlie^birTmlzed threshold corresponding to the writing 
means of each field is registered, proper recognition can be 
made. 
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CLAIMS 



[Claim(s)] 

[Claim 1] The image signal of the level corresponding to the concentration of each pixel is 
given from a reading meansjo^decompose into a pixel and to read the image information of a 
document which has^roor more entryfi^ds. An image input means to generate the image 
data of a multiple valueTbr this~evetyln>^l based on this image signal, An image storing 
means to store the image data of the multiple value generated with said image input means, A 
field information registration means by which the binary-ized threshold for making binary the 
image data of the positional information which showQheOgegrtoB^of two or more of said 
entry fields in said document, and this entry field was registered beforehand, The logging 
means which starts the image data of each of said entry field from said image storing means 
. with reference to said positional information, A binary-ized processing means to change and 
output the image data o^Qid entry field\started with said logging means to a binary image 
data according to the binary^i^^ of this entry field, The document reading system 

characterized by having a recognition processing means to recognize the image information of 
said entry field based on the binary image data outputted from said binary-ized processing 
means. 

DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention relates to the document reading system which reads the 

alphabetic character indicated by the document etc. and an image. 

[0002] 

[Description of the Prior Art] Drawing 2 is drawing showing an example of the document set 
as the reading object of a document reading system. This document is beforehand printed in 
the color [ the document code number which shows document classification to an upper left 
margin ] which can read black etc. Moreover, although identified by human being's eyes, a 
subject name and a preprinted guide line for an orange drop out color to show two or more 
blocks are beforehand printed by this document so that it may not be read by the machine. 
And in a preprinted guide line, it prints by the printer, or a predetermined matter is filled in 
with a black pencil, a black pen, etc., and is inputted into a document reading system. 
[0003] On the other hand, in the conventional document reading system, layout information, 
such as the format, i.e., the location of a preprinted guide line, the class of an alphabetic 
character and notation filled in, is beforehand registered into the document layout database for 
every document code number made into a reading object. And in a document reading system, 
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the document for reading is disassembled into a pixel, each pixel concentration is read 
optically, the read image data is changed into monochrome binary data using a fixed threshold, 
and it once accumulates in image image memory. Furthermore, a document layout database is 
searched with a document reading system according to the read document code number, the 
image data of the preprinted guide line for recognition is started from image image memory 
based on the layout information, and recognition processing is performed. 
[0004] 

[Problem(s) to be Solved by the Invention] However, the following technical problems 
occurred in the conventional document reading system. For example, since entry 
concentration differs for every entry means of those when the preprinted guide line which 
printed the alphabetic character etc. by the printer, the preprinted guide line which filled in the 
alphabetic character etc. in handwriting using the pencil etc., and the preprinted guide line 
which sealed with seal are intermingled like the document of drawing 2 , the pixel 
concentration of the image data read for every preprinted guide line differs. For this reason, it 
will be changed into an image data which is different in the actually indicated alphabetic 
character in the phase which makes binary the image data read optically, and there was a case 
where right recognition processing became impossible. This invention solves the technical 
problem which said conventional technique had, and is not influenced by the entry means, but 
offers the document reading system in which right recognition processing is possible. 
[0005] 

[Means for Solving the Problem] In order to solve said technical problem, this invention is set 
to a document reading system. The image signal of the level corresponding to the 
concentration of each pixel is given from a reading means to decompose into a pixel and to 
read the image information of a document which has two or more entry fields. An image input 
means to generate the image data of a multiple value for this every pixel based on this image 
signal, The binary-ized threshold for making binary the image data of the positional 
information which indicates the location of two or more of said entry fields in said document 
to be an image storing means to store the image data of the multiple value generated with said 
image input means, and this entry field is equipped with the field information registration 
means registered beforehand. Furthermore, a logging means by which this document reading 
system starts the image data of each of said entry field from said image storing means with 
reference to said positional information, A binary-ized processing means to change and output 
the image data of said entry field started with said logging means to a binary image data 
according to the binary-ized threshold of this entry field, It has a recognition processing 
means to recognize the image information of said entry field based on the binary image data 
outputted from said binary-ized processing means. 

[0006] According to this invention, since the document reading system was constituted as 
mentioned above, the following operations are performed. The image signal of the image 
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information of a document which has two or more entry fields read with the reading means is 
given to an image input means, and the image data of a multiple value is generated for every 
pixel. The image data of a multiple value is once stored in an image storing means for next 
processing. The image data stored in the image storing means is started per each entry field by 
the logging means based on the positional information which shows the location of the entry 
field registered into the field information registration means. The started image data is further 
changed into a binary image data by the binary-ized processing means based on the 
binary-ized threshold registered into the field information registration means. A binary image 
data is given to a recognition processing means, and the image information of the entry field 
is recognized. 
[0007] 

[Embodiment of the Invention] Drawing 1 is the document reading structure-of-a-system Fig. 
showing the operation gestalt of this invention. It has reading means (for example, scanner) 1 1 
for this document reading system to disassemble the document for reading (for example, 
document) into a pixel, and to read each pixel concentration optically, such as an image 
scanner and a digital camera, and this scanner 1 1 is connected to the image input means (for 
example, image input section) 12. The image input section 12 changes and outputs the image 
signal of the level corresponding to the concentration for every pixel given from the scanner 
1 1 to the image image data of the multiple value of 8 bits. The output side of the image input 
section 12 is connected to the system bus 13. Each processing means of the others and 
document discernment section 14, the logging means (for example, image logging section) 15, 
the binary-ized processing means (for example, binary-ized processing section) 16, and the 
recognition means (for example, recognition processing section) 17 is connected to the system 
bus 1 3 . [ section / 1 2 / image input ] Furthermore, each storage means of the image storing 
means (for example, image image memory) 18, -the . fie ld information regis tration means (for 
example, document layout database) 1 9, the binary image file 20, and the recognition result 
file 21 is connected to the system bus 13. Moreover, although not illustrated, the control 
section for controlling a series of actuation of each processing means is connected to the 
system bus 13. 

[0008] The image image memory 1 8 once stores the image image data of the multiple value 
of the document outputted from the image input section 12 for recognition processing etc. The 
document discernment section 14 identifies the classification of the read document based on 
the image image data stored in the image image memory 1 8. For example, if it is a document 
like drawing 2 , documen^assific^ 

recognition of the document codenumber indicatecTbylhe upper left margin^ / Moreover, if it is 
the document with which the document code number is not giyenjoa position, document 
classification will be identified base^on^e^dimension of the doc ument the description of the 
ruled line on a document, etc. The document layout databaseT9 is the file into which layout 



3 



i 



information / Avhret^shows 



fields (here, it is called the "field") divided with the preprinted guide line on the document etc. 
for every document classification, and a binary-ized threshold for making binary the image 
image data of each entry field, was registered beforehand. The image logging section 15 starts 
the image data of each field of the read document ( €^hyjMie)from the image image memory 

18 according to the positional information registered into the document layout database 19. 
[0009] The binary-ized processing section 16 changes and outputs the image data of the 
multiple value of 8 bits started in the image logging section 1 5 to an image data binary 

[ corresponding to white and black ] based on the binary-ized threshold registered into the 
document layout database 19. The binary image file 20 stores the binary image data outputted 
from the binary-ized processing section 16. The recognition processing section 17 performs 
recognition processing based on the image data stored in the binary image file 20^me field 
for [, such as an alphabetic character and a mark, ] recognition, and outputs the recognition 
result. The recognition result file 21 stores the recognition result recognized by the 
recognition processing section 17. 

[0010] Drawing 3 is drawing showing typically an example of the document layout database 

1 9 in drawing 1 . As shown in this drawing 3 , two or more fields set as the object of 
recognition are registered into the document layout database 1 9 for every document code 
number. And data, such as positional information (for example, address on the image memory 
corresponding to the coordinate of the diagonal point of the field etc.) of the field, information 
classification (for example, distinction of an image, an alphabetic character, a mark, etc.), and 
a binary-ized threshold, are beforehand stored to each of these fields. 

[0011] Drawing 4 is a flow chart which shows actuation of the document reading system of 
drawing 1 . Hereafter, actuation of drawing 1 is explained, referring to this drawing 4 . If it 
reads on the scanner 1 1 of the document reading system of drawing 1 and the target document 
is inputted, document reading processing of step SI of drawing 4 will be started with this 
scanner 11. In step SI, the image signal according to the concentration for every pixel of the 
document which was fe ^sembk xHn to the pixe l and read with the scanner 1 1 is given to the 
image input section 12. At step S2, in the image input section 12, an image signal is changed 
into the image image data of the multiple value which is 8 bits, and is stored in the image 
image memory 18 through a system bus 13. It progresses to step S3 after step S2. At step S3, 
the document discernment image data^storedjin the image 

image memory J 8 isjreferred to, and the document code number given to the position of a 
document is read.. Moreover, when it is the document with which the document code number 
is not given to a position, discernment of thread document is performed based on the 
documentjonnatdat^ with which the ^^ension^ F a document and the(^es^^^n)of a ruled 
toej^i^j^gister^ and the image image data stored in the image image memory 1 8. The 
document code numfteivof a discernment result is given to the image logging section 15, and 
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progresses to step S4. 

[0012] The image logging section 15 is started in step S4. The document layout database 19 is 
referred to by the image logging section 15, and the image data of the field unit for 
recognition is started from the image image memory 18. The started image data is given to the 
binary-ized processing section 16, and progresses to step S5. The binary-ized processing 
section 16 is started at step S5. The document layout database 19 is referred to by the 
binary-ized processing section 16, and the image data of a multiple value is changed into a 
binary image data based on a binary-ized threshold. A binary image data is stored in a binary 
image file, and progresses to step S6. At step S6, the document layout database 19 is referred 
to by the control section which is not (ihastratedTior ex^pleT^dTtls judged whether the 
information classification of the field is the information for [, such as an alphabetic character 
andanwk,,] recognition or it is the image information used as the candidate for recognition. 
If it is the information for recognition, it will progress to step S7, and if it is the image 
information used as the candidate for recognition, it will progress to step S8. 
[0013] The recognition processing section 17 is started at step S7. A binary image file is read 
by the recognition processing section 17, and recognition of an alphabetic character or a mark 
is performed. The data of a recognition result are stored in the recognition result file 21, and 
progress to step S8. At step S8, it is judged whether the document layout database 19 was 
referred to by the control section, and all processings of the field for recognition were 
completed by it. If the field whichhasjiot completed processing yet remains, processing of 
return and the remaining fields will be perforn^T^fep~S4~^ of all the 

fields is completed, reading processing of the document will be ended. 
[0014] Thus, the document reading system of this operation-gest^lt has the document layout 
data file 19 which registered the binary-ized threshold for gery fi ^ld for reading, and the 
binary-ized processing section 16 which makes the image data of a multiple value binary 
based on the binary-ized threshold. For this reason, since binary-ized processing can be 
performed with the suitable binary-ized threshold registered beforehand, respectively even if 
concentration, such as an alphabetic character indicated for every field, differs, it is not 
influenced by difference of entry means [ printer printing writing / pencil /, etc. ], but there is 
an advantage that right recognition processing can be performed. 

[0015] In addition, this invention is not limited to the above-mentioned operation gestalt, but 
various deformation is possible for it. As this modification, there is a thing like following (a) - 
(f), for example. 

(a) Although the image signal given from the scanner 1 1 is changed into the image image data 
of the multiple value of 8 bits in the image input section 12, not to be limited to 8 bits but 
what is necessary is just the multiple value of 2 bits or more. If there is little number of bits, it 
will become impossible however, to set up a fine threshold. 

(b) The thing in which the fields, such as an image with which the document for reading does 
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not serve as an alphabetic character for recognition etc. and a candidate for recognition, are 
intermingled, or the thing which consisted of only alphabetic fields for recognition can be 
read similarly. 

[0016] (c) If it is the case where only one kind of document is read, the document discernment 
section 14 can be deleted. 

(d) Although recognition processing by the recognition processing section 17 is performed 
once it stores the processing result of the binary-ized processing section 1 6 in the binary 
image file 20, it may be made to perform recognition processing immediately, without 
minding this binary image file 20. 

(e) Although he is trying to store the recognition result of the recognition processing section 

1 7 in the recognition result file 21 , you may make it transmit to a host computer etc. through a 
communication line etc. 

(f) Although it has the composition of having connected each processing section of image 
input section 12 grade by the system bus 13, software processing may be made to perform the 
same actuation as each of these processing sections using a computer. 

[0017] 

[Effect of the Invention] As explained to the detail above, according to this invention, it has a 
field information registration means by which the binary-ized threshold of the image data of 

each ^ntFy^teklgfth e document for reading was registered, and a binary-ized processing 

■ -i 

means to change an image data into binary from a multiple value according to the binary-ized 
threshold. It is effective in becoming possible to make it binary for every entry field with the 
threshold corresponding to the entiy means (for example, printer printing, pencil writing, etc.) 
by this, and right recognition processing being attained for every entry field. 

DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

r Drawing 1] It is the document reading structure-of-a-system Fig. showing the operation 
gestalt of this invention. 

[Drawing 2] It is drawing showing an example of the document set as the reading object of a 
document reading system. 

[Drawing 3] It is drawing showing typically an example of the document layout database 19 
in drawing 1 . 

[Drawing 4] It is the flow chart which shows actuation of the document reading system of 
drawing 1 . 

[Description of Notations] 

11 Scanner 

1 2 Image Input Section 
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14 Document Discernment Section 

1 5 Image Logging Section 

1 6 Binary-ized Processing Section 

17 Recognition Processing Section 

1 8 Image Image Memory 

1 9 Document Layout Database 

20 Binary Image File 

2 1 Recognition Result File 



